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Abstract 



We consider an Ito stochastic differential equation with delay, driven by brownian mo- 
tion, whose solution, by an appropriate reformulation, defines a Markov process X with 
values in a space of continuous functions C, with generator C. We then consider a back- 
ward stochastic differential equation depending on X, with unknown processes (Y,Z), and 
we study properties of the resulting system, in particular we identify the process Z as a 
deterministic functional of X. We next prove that the forward-backward system provides a 
' suitable solution to a class of parabolic partial differential equations on the space C driven 

by £, and we apply this result to prove a characterization of the fair price and the hedging 
strategy for a financial market with memory effects. We also include applications to optimal 
stochastic control of differential equation with delay: in particular we characterize optimal 
controls as feedback laws in terms the process X. 



o 

q ; 1 Introduction 

We will consider stochastic differential equations with delay (SDDEs for short) on a finite interval 
^ ■ of the form 

Hi / dy t = b(t,y t+ .)dt + a(t,y t+ .)dW t , te[0,T], 

\ y e = x(9), 9 G [— r, 0], 

for an unknown process (yt)t^[-r,T] in Here r > is called the delay and we use the notation 
yt+. = (yt+e)ee[-r,o]- It is customary, and convenient, to introduce the space C = C([— r, 0];M n ) 
and the C-valued process X = (Vt) tg r T i defined by 

X t (9) = y t+e , 9e[-r,0) 

With this notation, b(t, •) and a(t, •) are functions defined on C and the equation can be written 

dy t = b(t,X t ) dt + a(t,Xt) dW t , t G [0,T], 
X t = x G C. 

SDDEs are a classical subject: in the standard reference book [T7] (see also [18]) basic re- 
sults are established: existence and uniqueness of solutions, regular dependence on parameters, 
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Markov property of X as a C-valued process, characterization of its generator. In [7] long time 
asymptotics is studied in detail. 

In this paper we will present new results on optimal control problems for SDDEs. More- 
over, since the Markov character of solutions allows for application of dynamic programming 
arguments, we will also prove new results on the corresponding Hamilton-Jacobi-Bellman equa- 
tion. More generally, we will consider a class of semilinear versions of the parabolic Kolmogorov 
equation associated to the process X. This class includes as a very special case some infinite- 
dimensional variants of the Black-Scholes equation for the fair price of an option, of great interest 
in mathematical finance and already considered in [S]. 

The main tool will be the use of techniques from the theory of backward stochastic differential 
equations (BSDEs) in the sense of Pardoux-Peng, first considered in the nonlinear case in the 
paper [23]. We refer to the monographs [8], [22] for an exposition of the basic theory. The BSDE 
approach that we follow consists in addressing equation (II. ip . but with generic initial values 
t £ [0, T] and x G C = C([— r, 0]; R n ), and then coupling with another equation of backward 
type, with unknown processes (Y, Z). More precisely one considers the forward-backward system 



dtff = b{T,X l f) dr + a{T,X l T ' x ) dW T 

rt,X 

X, 

rt,X Yt,X gt,X\ J_ i r?t,X 



dyy 

Yl 



= ^(r,X t T 

4>{x\ 



T€[t,T\ C [0,T], 
dr + Z T T ' x dW T , 



(1.2) 



where i/):[0,T]xCxMxM' ! ->K and 
(deterministic) function v : [0, T] X C — ► 



: C — > R are given functions. One can then define a 
'. setting v(t, x) = Y^' x and prove that 



Y 



t.x 



V [T. 



X?), ZY = V v{r,X^)a(T,X^ 



where Vq is a differential operator defined by 



V v{t,x) = V x v(t,x)({0}) 



;i.3) 



;i-4) 



where the gradient V x v(t,x) at point (t, x) £ [0, T] x C is an element of the dual space C*, 
hence an n-tuple of finite Borel measures on [— r, 0]. Thus, Vov(t,x) is a vector in R n whose 
components are the masses at point of the components of V x v(t, x). 

It turns out that v is a solution of a semilinear parabolic equation of the form: 



' dv(t,x) 
dt 

, v(T,x) = <j)(x) 



Ctv{t, x) = ip(t, x, v(t, x),Vov(t, x) a(t, x)), 
t e [0,T], x e C, 



1.5) 



where Ct is the generator of the Markov process (X*' x ). In the finite-dimensional case, this was 
first proved in [24] for classical solutions of (11. 5j) . If one considers the controlled SDDE 



;i.6) 



dy u s = b(s,X^) ds + a(s,Xf) [h(s,X^u s )) ds + dW s ], s G [t,T], 
X t = x. 



where the solution depends on a control process u(-) taking values in a space U, and h : [0, T] x 
C x U —> is given, and one tries to minimize a cost functional 



J(t,x,u(-)) =E J g(u s ) ds + E(p(X%), 



;i.7) 



2 



where g : U — * [0, oo), then equation (|1.5p is the associated Hamilton-Jacobi-Bellman equation, 
provided the hamiltonian function tft : [0, T] x C x R rf — ► R is defined by the formula 



ip (t, x, z) = inf {g(u) + zh (t, x,u) : u £ U} 



t € [0,T],x eC,ze R d . 



This way we eventually prove that v coincides with the value function of the control problem. 

Although BSDEs were known to be useful tools in the study of control problems and nonlinear 
partial differential equations, applications to infinite-dimensional state spaces are more recent 
and difficult: see e.g. [10], [11] for the case of a Hilbert space, and [16] for some related results on 
Banach spaces. In these papers, as well as in the present one, the solution of (|1.5|) is understood 
in the so-called mild sense. Special difficulties are given by the fact that the state space C is 
used as the state space of the basic stochastic process X. The reason for doing this is to allow 
for great generality on the coefficients b, a of the SDDEs as well as on the cost functional of the 
control problem. For instance the functional (ft occurring in (|1.7p could have the form 



for some g £ C 1 (IR) and some measure /i on [— r, 0]. The special case when fi is supported 
on a finite number of points is of particular interest and could be studied by direct methods, 
but it is included in our results. More generally, if (f)\, . . . ,(j) n are functionals with the form 
(II. 8p corresponding to functions gi,...,g n G C 1 (]R), and if h € C^R"), then the functional 
<f>{z) = h((f>i(z), . . . , cft n (z)), can also be treated by our methods. One could avoid the use of 
the space C by looking at X as a process with values in the space L 2 ([0,T];R n ) instead. This 
was the approach taken in [12j . However, this leads to restrictions on the applicability of the 
corresponding results. 

Optimal control problems for SDDEs have been thoroughly investigated in recent years: we 
refer to the book [4] and the references therein. One of the main results there is the charac- 
terization of the value function as the unique viscosity solution of the Hamilton-Jacobi-Bellman 
equation (jl.5p . This is achieved for controlled equations of more general form than (jl.6p . in par- 
ticular with possibly control-dependent diffusion coefficients. In our paper we assume stronger 
conditions, namely a special form for the control system (jl.6p and differentiability assumptions 
on the data b, a, (ft, ip with respect to the space variable x E C. In this case we are able to prove 
further properties on the solution v, namely its differentiability. Thus, some of our results can 
be viewed as regularity properties, occurring under appropriate assumptions, of the viscosity 
solutions of (jl.5p . However, the existence of the gradient of v is of special interest in optimal 
control theory, since it allows to characterize optimal controls via feedback laws and to prove 
existence of optimal controls after appropriate formulation. 

Parabolic equations on the space C of the form (II. 5p have also been considered for other 
purposes, in particular as a generalization of the Black-Scholes equation for the fair price of an 
option, in case the market models exhibits memory effects, see [5]. In particular, the special 
operator Vo defined in (jl.4[) also occurs in the class of equations considered there. In the same 
spirit in pQ some formulae of Black-Scholes type are proved. Here again the approach based on 
BSDEs gives new results in comparision to [5]. 

The plan of the paper is as follows: in section [2] we introduce notation and review some 
results on SDDEs, adding some precision on regularity properties of the solution, concerning in 
particular their Malliavin derivative. Section [3] is devoted to proving Theorem 13.11 which is the 
key of many subsequent results; here the operator Vo is introduced. In section |4] we present the 
forward-backward system (|1.2p and prove in particular the second formula in (jl.3p . In section [5] 
we study the optimal control problem, we prove in particular the so-called fundamental relation, 




[-r,0] 




zee, 
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we give criteria for optimality of feedback controls and we prove existence of optimal controls in 
the weak sense. Section [6] is devoted to the study of equation (|1.5|) : it is proved that a solution 
exists and it is connected to the solution of the forward-backward system (|1.2p by formula (|1.3j) ; 
in particular it is proved that the value function of the control problem is a solution to this 
equation (in the mild sense); finally, it is shown how (jl.5|) may arise as the a Black-Scholes 
equation in a financial market and we give explicit conditions for its solvability. 



2 Preliminary results on stochastic delay differential equations 
2.1 Notations 

In this paper we consider a complete probability space (O, J 7 , P) and a standard Wiener process 
W = (Wt)t>o with values in M. d . We denote by (J r t)t>o the natural filtration of W augmented 
in the usual way by the sets of P- measure 0. 
For fixed r > 0, we introduce the space 

C = C([-r, 0];R") 

of continuous functions from [— r, 0] to M. n , endowed with the usual norm |/|c = sup^ g r r)0 i | /(#)!• 
We will consider C-valued stochastic processes: for T > we say that a C-valued process 
(X t ) t £[o,T] belongs to the space 5 P ([0, T];C) (1 < p < oo) if its path are C-continuous P-a.s. 
and the norm 

u ' J ' ; te[o,T] te[o,T]9e[-r,o] 

is finite. Here and in the following, if no confusion is possible, we denote the norm of M n , M. d 
and R nd by | • |. 

We next define several classes of differentiable functions between Banach spaces, first intro- 
duced in [10] in connection with stochastic processes, which allow to formulate several regularity 
results in a compact way. 

In the following, if E and K are Banach spaces, we denote by G 1 (E,K) the space of continu- 
ous functions u : E — > K such that: 1) u is continuous; 2) u is Gateaux differentiable on E, with 
Gateaux differential at point x G E denoted by Vit(x) G L(E, K) (the latter being the space of 
bounded linear operators from E to K, endowed with its usual norm); 3) for every h G E, the 
map x — > Vu(x) h is continuous from E to K. We note that the map x — > Vu(x) is not required 
to be continuous from E to L(E,K): if this happens then u is also Frechet differentiable. 

We say that a function v : [0, T] x E — > K belongs to £/ 0,1 ([0, T] x E, K) if: 1) v is continuous; 
2) for every t G [0, T], v(t,-) is Gateaux differentiable on E, with Gateaux differential at point 
x G E denoted by V x v(t, x) G L(E,K); 3) for every h G E, the map (i, x) — > V ' x v(t,x)h is 
continuous from [0, T] x E to K. 

Now suppose E = C([a, 6];P n ), where a,b G M, a < b. We recall that the dual space 
of C([a,6]) is the space of finite Borel measures on [a, b], endowed with the variation norm. 
Identifying E with the product space C([a,b]) n in the obvious way we conclude that the dual 
space E* of E can be identified with the space of n-tuples fi = {Hk)k=ii w here each [i^ is a 
finite Borel measure on [a, b], and the value of [i at an element g = (gk)k=i e C([a,b]) n , where 
gk G C([a, 6]), is denoted 



/ 



n „ 

g{9) ■ n{M) = J2 9k{e)vk{d6). 

a-M fc=1 J[a,b] 



4 



Let v : [0, 7] x C — > R be a function such that v(t, •) is Gateaux differentiable on C for every 
t G [0,T]. Then the gradient V x v(t,x) at point (t,x) G [0,7] x C is an n-tuple of finite Borel 
measures on [— r, 0]. We denote by \V x v(t,x)\ its total variation norm and we define 

V v{t,x) = V x v(t,x)({0}) (2.1) 

i.e., Vov(t,x) is a vector in R n whose components V^v{t,x) (k = 1, ...,n) are the masses at 
point of the components of V x v(t, x). 

Remark 2.1 In the following, a basic role will be played by the space £/ 0,1 ([0,T] x C,R): 
according to the previous definitions, it consists of real continuous functions v on [0, 7] x C such 
that, for every t G [0, 7], v(t, •) is Gateaux differentiable on C, with Gateaux differential at point 
x G C denoted by V x v(t,x) (an n-tuple of finite Borel measures on [— r, 0]), such that the map 

(t, x) -► (V^(t, s), fc) c .,c = / h(9) ■ V x v(t, x)(d9) 

J[-r,0] 

is continuous on [0, 7] x C, for every h G C. 

2.2 Stochastic delay differential equations 

We fix 7 > and we consider the following stochastic delay differential equation for an unknown 
process (yt)te[o,T] taking values in R n : 

f dy t = b{t,y t+ .)dt + a{t,y t+ .)dW t , t e [0,7], , , 

\ ye = x(6), 6 G [— r, 0], ^ 

where yt+. denotes the past trajectory from time t — r up to time t, namely yt+. = (yt+e)ee[-r,o]i 
and r > is the delay. b(t, •) and a(t, •) are functions of the past trajectory of y and they are 
defined on the space of continuous functions, namely b : [0,T]xC ^ R n and a : [0,T]xC ^ R n<i , 
where R nd is identified with L(R rf , R n ) the space of linear operators from R rf to R n . The function 
x G C is the initial condition. We will refer to equation (12. 2D as delay equation. 
We make the following assumptions on the coefficients of ()2.2p . 

Hypothesis 2.2 1. The functions b : [0, T) x C -> M n and cr : [0,T] x C -> R nd are contin- 
uous and there exists a constant K > such that for all t G [0,7] and y(-) G C 

\b(t,y(.))\ + \a(t,y(.))\<K(l + \y(.)\ c y, 

2. there exists a constant L > such that for all t G [0, T] and y(-), z(-) G C 

\b(t,y(-))-b(t,z(-))\ + \a(t,y(-))-a(t,z(-))\ < L\y{-) - z(-)\ c ; 

3. for allte [0,7], b(t,-) G ^(C,^), a(t,-) G ^(C^^). 

In the following we collect some results on existence and uniqueness of a solution to equation 
(|2.2p and on its regular dependence on the initial condition. It turns out that there exists a 
continuous solution, so we can define a C-valued process X = (-X't)tg[o,T] by 

X t (9) = y t+e , 9e[-r,0]. (2.3) 

We notice that if t + 9 < then y t+ Q = x(t + 0). We will use the notations y x , yf, X x or Xf 
to indicate dependence on the starting point x G C. 
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Theorem 2.3 If Hvpothesis \2.2\ points 1 and 2, holds true then there exists a unique continuous 
adapted solution of the delay equation \2. 2\) . and moreover the process (^t)t&[o,T] belongs to 
S p ([0, T]; C) for every p > 2 and 

\\ X \\ P sp(\ot]-c) =E SU P \Vt\ P < C 

*S[-r,T] 

for some constant C > depending only on K,L,T,p. 

In addition, the map x — ► X x is Lipschitz continuous from C to S p ([0, T]; C); more precisely, 

\\X X1 -X X >\\ SP{[0>T] . C) = (e sup \y^-y^A <L sup \x x {9) - x 2 (9)\ 

\ te[-r,T] J ee[-r,o] 

for some constant L > depending only on K,L,T,p. 

If we further assume that Hypothesis \2.2\ point 3, holds true then the map x — > X x belongs 
to the space ^ 1 (C, S p ([0, T]; C)). 

Proof. For the proof (in the case of p = 2), we refer to [T7], Chapter II: we refer to Theorem 2.1 
for existence and uniqueness of the solution of equation (|2.2p . to Theorem 3.1 for the Lipschitz 
dependence of this solution on the initial datum, and to Theorem 3.2 for the differentiability of 
the solution with respect to the initial datum. See also [18], Theorems 1.1 and 1.2. The proof in 
the case of p > 2 can be performed in a similar way. □ 
Let us introduce a delay equation similar to (|2.2p but with initial condition given at time 
t€ [0,T]: 



dyY = b(r, y% ) dr + a(r, y% ) dW T , r € [t,T], 



(2.4) 



y% = x{9), 9e[-r,0]. 
We introduce the C-valued process given by 

0€[-r,0]. (2.5) 

By [Uj, Chapter III, Theorem 2.1, the C-valued process (Xr' x ) T ^ t ^ is a Markov process with 
transition semigroup, acting on bounded and Borel measurable <j) : C — * R, given by 

P tjT [tf>\{x) = E(/)(X t T ' x ), 0<t<r<T, ieC. (2.6) 

Remark 2.4 The transition semigroup {Pt, T ) has been extensively studied in the literature, see 
e.g. [17J and [18J. For the sake of completeness, we briefly recall some result on its generator, 
which will appear in section 6 in the formulation of the Kolmogorov equation. For simplicity, let 
us consider the autonomous case in equation (|2.4p : b and a do not depend on time and s = 0, 
so we consider the one parameter semigroup (Pt)t£[o,T]- The transition semigroup (Pf) is never 
strongly continuous on the space C, nevertheless it admits a weakly continuous generator £, see 
[T7j . chapter IV and [18], chapter II. Let St : C — > C denote the shift operator and let S denote 
the weak generator of the corresponding semigroup. To derive a formula for the generator C 
we need to augment C by adding an n-dimensional direction. C will be equal to the sum of 
the generator of the shift semigroup S and a second order linear partial differential operator 
along this new direction. Let F n := {vl : v £ W 1 } and C © F n := {/ + vl : f £ C, v G W 1 } 
with the norm ||/ + ulo||cffiF„ := l/|c + M- Suppose that <fi : C — > R is twice continuously 
Frechet differentiable and let / G C. Then the Frechet derivatives Vcfi(f) and V 2 ^(/) have 
unique weakly continuous linear and bilinear extensions 



V0(/) : C © F n - R, V 2 0(/) : (C © F n ) x (C © F n ) 
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We are ready to introduce C Suppose that <f> : C — > R, (j) G D(S), and (j) is sufficiently smooth 
(e.g. <j) is twice continuously differentiable and its derivatives are globally bounded and lipschitz 
continuous). Then (j) G D(C) and V/ G C 

£(0)(/)=5(^)(/)+W(7)W)lo) + ^f;vW(a(/)(eOlo,a(/)(eOlo), (2.7) 

i=l 

where {ej}™ =1 is any basis of R n . 

2.3 Differentiability in the Malliavin sense 

Our aim is now to compute the Malliavin derivative of the solution of the delay equation. We 
start by recalling some basic definitions from the Malliavin calculus. We refer the reader to the 
book [19j for a detailed exposition. 

We consider again a standard Wiener process W = (W t ) t >o in R d and the Hilbert space 
L 2 ([0,T];R d ) of Borel measurable, square summable functions on [0, T] with values in R d , with 
its natural inner product. This can be identified with the product space (L 2 ([0, T])) d or with the 
space L 2 {T), where the measure space T := [0, T] x {1, . . . , d} is endowed with the product of the 
Lebesgue measure on [0, T] and the counting measure on {1, ... , d}. Elements h G L 2 ([0, T];R d ) 
may be denoted {h^(s), s G [0,T], j = 1, . . . ,d} or {hP}, where h? G L 2 ([0, T]). 

For every h G L 2 ([0,T];R d ) we denote 



W(h) = 




W is an isometry of L 2 ([0, T}; R d ) onto a gaussian subspace of L 2 (ft), called the first Wiener 
chaos. Given a Hilbert space K, let Sk be the set of K- valued random variables F of the form 

m 

F = Y J fr{W{h l ),...,W(h n ))k r , 
r=l 

where hi, . . . ,h n G L 2 ([0,T];R d ), {k r } is a basis of K and fi, ■ ■ ■ f m are infinitely differentiable 
functions R n — > M bounded together with all their derivatives. The Malliavin derivative DF of 
F G 5^ is defined as the process {D J S F; s G [0, T], j G {1, . . . , d}} given by 

m n 

D{F = J2Y, dkMWfa), W(h n )) h{(s) k r , 

r=l k=l 

with values in K; by we denote the partial derivatives with respect to the A:-th variable. It is 
known that the operator D : S K C L 2 {Q,; K) -> L 2 (fi x [0, T] x {1, . . . , d}; K) = L 2 (Q, x T ; if) 
is closable. We denote by D 1 ' 2 (JC) the domain of its closure, endowed with the graph norm, and 
we use the same letter to denote D and its closure: 

D : B 1,2 (K) C L 2 (n;K) -> L 2 {VL x T;K). 

The adjoint operator of D, 

5 : dom (5) C L 2 (Q xT;K) -> L 2 {Sl; K), 
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is called Skorohod integral. For a process u = {ui; s £ [0, T], j £ {1, . . . , d}} £ dom(<5) we will 
also use the notations 

/■T d i-T 

b(u) = u s dW s = V / v? s dW 3 s . 
Jo j=1 Jo 

It is known that dom(<5) contains every (./^-predictable process in L 2 (f2 x T;K) and for such 
processes the Skorohod integral coincides with the Ito integral; dom(<5) also contains the class 
L 1 ' 2 (K), the latter being defined as the space of processes u £ L 2 (Q x T;K) such that u\ £ 
J} 1,2 (K) for a.e. t £ [0, T] and every j, and there exists a measurable version of F>\u\ satisfying 



I II 2 - II II 2 

\ U \\V-*{K) — \\ U \\L^(QxT;K) 



- i-T i-T 

+ EV / / \D x s u\f K dtds < oo. 
iJ=1 Jo Jo 



Moreover, 1 1^(^)11^2^.^) < II^IIl 1 . 2 ^) - ^ e no * e ^ e s P ace ^ 1,2 (K) is isometrically isomor- 
phic to L 2 (T;n^ 2 (K)). 

Finally we recall that if F £ 1,2 (K) is measurable with respect to Tt then D 3 F = a.s. on 
Q x (t, T] for every j. 

If K = M or K = R n , we write B 1 - 2 and L 1 ' 2 instead of 0^ 2 (K) and h^ 2 (K) respectively. 

We now introduce the Malliavin derivative for a functional of a stochastic process. In the 
remainder of this section we set E = C([— r, T]; M n ). If / £ Q 1 (E,M n ) then, according to the 
notation introduced above, 

(Vf(x),g) E *, E = [ g(e)-Vf(x)(de), x,g£E. 

If y is a continuous stochastic process with time parameter [— r, T] then f{y) is a random 
variable. We wish to state a chain rule for the Malliavin derivative of f(y.)- We will restrict to 
the case when y is adapted, more precisely its restriction to [0,T] is adapted to (^Ft)te[o,T] an d 
its restriction to [— r, 0] is deterministic. Clearly, Dyt = for t £ [— r, 0]. Following p3|, lemma 
2.6, we have the following basic result (we note that in [Hj derivatives are understood in the 
sense of Frechet, but the same arguments apply to the present situation). 

Lemma 2.5 For E = C([— r, T]; W 1 ), let f £ Q 1 (E,'K) be a Lipschitz continuous function. 
Assume that y = (yt)te[-r,T] ^ s a process in W 1 satisfying the following conditions: 

1. y is a continuous adapted process and E sup tg [__ r r ] \yt\ 2 < oo; 

2. y £ L 2 ([— r, T], D 1 ' 2 ) and the process {D s yt,0 < s < t < T} admits a version such that, 
for every s £ [0, T], {D s y t ,t £ [s,T]} is a continuous process and 

E / sup \D s y t \ 2 ds < oo. 

te[s,T] 

Then f(y.) £ B 1 ' 2 and its Malliavin derivative is given by the formula: for j = 1, ...,d and a.e. 
s £ [0,T] we have, P-a.s., 

D{{f{y.)) = (Vf(y.),Diy.) E *,E = [ D{y e • Vf(y.)(d8). (2.8) 

J[-r,T] 

Next we establish when the solution of the delay equation is Malliavin differentiable, and 
moreover we write a stochastic (functional) differential equation satisfied by the Malliavin deriva- 
tive. We substantially follow [14J, Theorem 4.1. 
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Theorem 2.6 Let Hypothesis 1 2. 2\ be satisfied. Then the solution (yt)te[-r,T] satisfies conditions 
1. and 2. in Lemma \2.5[ Moreover yt G B 1 ' 2 for every t G [0, T] and the following equation 
holds: for j = 1, ...,d and every s G [0, T] we have, F-a.s., 



Dim 



ft 



+ 

, Dim = o, 

Finally, for every p G [2, oo) and s G [0, T] we have 



r,0] 

t G [-r, s) 



Diy t+e -V x b{t,y t+ .)(d6) dt 

te[s,T], 



[-no] 

Diy t+g -V x a(t,y t+ .)(d0) dW t , 



(2.9) 



E 



r 



sup |D s yf| p (is < oo. 

o te[s,T] 



(2.10) 



Proof. Except for the final statement, the proof can be achieved with techniques similar to the 
ones indicated in the proof of Theorem 4. 1 in p3] . The only minor difference is that we consider 
a general delay differential equation, while in |14j the coefficients depend on the past behavior 
of the solution only after time 0: however, the same arguments apply. 

The proof of the final statement follows by standard estimates on equation (|2.10p . taking 
into account that V x b and V x cr are bounded in the total variation norm. □ 



Corollary 2.7 Suppose that the assumptions of Theorem \2. 6\ hold true, let C = C([— r, 0];R n ) 
and (X t ) t £[o,T] be the C-valued process defined by $2.3\) . Suppose that f G ^ 1 (C;R) satisfies 

|V/(x)| <C(l + \x\ c ) m , xeC, 

for some C > and m > 0. 

Then for every t G [0, T], f(Xt) = f{yt+-) belongs to D 1,2 and for j = 1, ...,d we have, for 
a.e. s G [0,T], F-a.s., 

Di(f(X t )) = (Vf(X t ),Diut+.)c,C = [ Diy t+e ■ Vf(X t )(d6). (2.11) 

Proof. The conclusion follows immediately from Lemma [2 .51 and Theorem 12.61 if / is a Lipschitz 
function. The general case can be proved by approximating / by a sequence of Lipschitz functions 
obtained by a standard truncation procedure. □ 



Remark 2.8 The first result on Malliavin differentiability of the solution of a functional stochas- 
tic differential equations was proved in [15]. In that paper the aim was to prove that yt belongs 
to the domain of the generator of the Ornstein-Uhlenbeck semigroup of the Malliavin calculus, 
therefore more restrictive assumptions were assumed on the coefficients of equation (|2.2p . in 
particular they were required to be twice differentiable. 



3 A result on joint quadratic variations 

The aim of this section is to state and prove a technical result, Theorem 13. 11 which will be used 
in the rest of this paper. To state this theorem we need to recall some definitions concerning joint 
quadratic variations of stochastic processes and to introduce a differential operator, denoted Vo, 
which will also play a basic role in the sequel. 
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We say that a pair of real stochastic processes (X t ,Y t ), t > 0, admits a joint quadratic 
variation on the interval [0, T] if setting 

1 f T 

C( QjT] (X,Y) = -J (X t+e -X t )(Y t+e -Y t )dt, e>0, 

the limit lim e _»o t] (X, Y) exists in probability. The limit will be denoted {X, K)[o,t]- 

This definition is taken from [26J, except that we do not require that the convergence in 
probability holds uniformly with respect to time. In |26j the process {X, Y) is called generalized 
covariation process; several properties are investigated in [27], [28], often in connection with the 
stochastic calculus introduced in [25J. With respect to the classical definition, the present one 
has some technical advantages that are useful when dealing with convergence issues (compare 
for instance the proof of Theorem 13.11 below) . 

In the following we will consider joint quadratic variations over different intervals, which is 
defined by obvious modifications. 

It is easy to show that if X has paths with finite variation and Y has continuous paths then 
(X,Y) [OiT] =0. 

If X and Y are stochastic integrals with respect to the Wiener process then the joint quadratic 
variation as defined above coincides with the classical one. A similar conclusion holds for general 
semimartingales: see [26], Proposition 1.1. 

We set C = C([-r, 0]; R n ) and, for every t G [0, T] and x G C, we let {Xt' x , s G [t, T]} denote 
the process defined by the equality (|2.5j) , obtained as a solution to equation (|2.4j) . In particular 
it is an C-valued process with continuous paths and adapted to the filtration {J~[t :S ], s G [t, T]}. 
xI' x (lo) is measurable in (u,s,t,x). 

Let u : [0, T] x C —* R be a function such that u(t, •) is Gateaux differentiable on C for 
every t G [0, T]. Then the gradient V x u(t,x) at point (t,x) G [0, T] x C is an n-tuple of finite 
Borel measures on [— r, 0]; we denote by \'V x u(t,x)\ its total variation norm and we denote 
Vou(t,x) = V x u(t,x)({0}), compare (|2 . 1 [) ; thus, Vou(t, x) is a vector in R n whose components 
Vqu(£, x) (k = 1, . . . , n) are the masses at point of the components of V x u(t, x). 

We denote W l [i = 1, ... ,d) the i-th component of the Wiener process W, by a' 1 the i-th 
column of the n x d matrix <r, and by a\ (k = 1, . . . ,n) its components. 

Theorem 3.1 Assume that u : [0,T] x C — > R zs a Borel measurable function such that u(t, •) G 
^(C^R) /or every t G [0,T] and 

|u(t,x)| + |V x u(t,x)| < C(l + |x|) m , (3.1) 

/or some C > 0, m > and /or every t G [0, T], x £ E. 

Then for every x G C, i = 1, ... ,d and < t < T' < T the processes {u(s, Xt ,x ), s G [i,? 1 ]} 
and VF* admit a joint quadratic variation on the interval [t,T'], given by the formula: 

(u^X^lW*)^ = / j&X?) ■ VouCs.Jtf*) d« = E / 4is,X^) ■ V k u(s,Xi> x ) ds. 
Jt k=1 Jt 

Proof. For the sake of simplicity we write the proof in the case t = 0, the general case being 
deduced by the same arguments. 

We fix x G C, T G (0,T) and we denote X°> x by X for simplicity. Thus, X t = y(t + •), 
t G [0, T], satisfies 

dy{t) = b(t, X t ) dt + a(t, X t ) dW t , X = x. 
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We will use the results on the Malliavin derivatives stated in Theorem 12.61 an d m particular 
formula (|2.9p that, in view of (|2.1ip . can be written in the form: 

D s y{t) =a(s,X s ) + ! D s [b(r,X r )] dr + I D s [a(r,X r )} dW r , (3.2) 

J s J s 

for < t < s < T. Noting that V x b(t,x) and V x a(t,x) are bounded by the Lipschitz constant 
L of b(t, •) and a(t, •), it follows from (|2.1ip that for every r € [0, T] 



\\D.[b(r,X r )}\\ 2 <L 2 [ sup \D s y(t)\ 2 ds, \\D.[a(r, X r )}\\ 2 < L 2 [ sup \D s y(t)\ 2 ds, 
Jo te[s,T] Jo te[s,T\ 

(3-3) 

where || • || denotes the norm in L 2 ([0, T];] 
We have to prove that 



1 



C':=C{ QT , ] (u(;X.),W i ) = - / {u(t + e,X t+e )-u(t,X t ))(Wt +e -Wl)dt 
, e Jo 

-» / a l (t,X t )-V u(t,X t )dt 
Jo 

in probability, as e — > 0. 

We need to re-write C € in an appropriate way, fixing e > so small that T' + e < T. We 
first explain our argument by writing down some informal passages: by the rules of Malliavin 
calculus we have, for a.a. t £ [0, T'], 



(u(t + e, X t+e ) - u(t, X t )){Wt +e - Wl) = (u(t + e, X t+e ) - u(t, X t ))e* dW t 



pt+e rt+e 

/ Di(u{t + e, X t+e ) - u{t, X t )) ds + / (u(t + e, X t+e ) - u(t, X t ))e*dW s , 
Jt Jt 



(3.4) 



where the symbol dW denotes the Skorohod integral, and by we denote the i-th component 
of the canonical basis of M. d and by e* its transpose (row) vector. Integrating over [0, T'] with 
respect to t and interchanging integrals gives 

rT' rt+e 

eC e = / Di(u(t + e,X t+e )-u(t,X t ))dsdt 

° ,-xKe psAT' (3-5) 

+ / / (u(t + e, X t+t ) - u(t, X t )) dt e* dW s . 

JO J(s-e) + 

To justify (13. 4p and (13. 5|) rigorously we proceed as follows. To shorten notation we define 
v t = (u(t + e, X t+e ) - u(t, X t )) l [0jT ,] (t), t e [0, T], 

A e = {{t, s) G [0, T] x [0, T] : 0<t<T',t<s<t + e}. 

Using Corollary 12.71 and formula (|2,10p . it is easy to show that, for all t, v t belongs to ID 1 ' 2 and 
the process v t U^(i, •) belongs to L 2 {Q, x [0, T\). By [20] Theorem 3.2 (see also pjjj Section 1.3.1 
(4)) we conclude that vt 1a e (^ •) e* is Skorohod integrable and the formula 

I v t 1 A * (t, s) e* dW s = v t [ 1a« (t, s) e* dW s - [ D\v t 1 A « (t, s) ds =: z t , (3.6) 
jo Jo Jo 
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holds provided z t belongs to L 2 (f2). Since J Q T lA e (t, s) dW s coincides with the Ito integral 

Jq lyi e (i) s) dW s = (Wt+e — Wt)l\tT'](t) > it i s i n f ac t eas Y to verify that we even have z G 
L 2 (Q x [0,T]); thus <^M) holds for a.a. t, and ([Ml) yields ([331) for a.a. t G [0,T']. 

Next we wish to show that the process Jq Vtl A e (t, •) efte^ is Skorohod integrable and to 
compute its integral, which occurs in the right-hand side of f|3.5[) . For arbitrary G G B 1 ' 2 , by 
the definition of the Skorohod integral and by (|3.6|) , 

e/ // v t l A e(t,s)dtei,D s G) ds = I l/ («tU«(t, a) D s G) R d eft 

io WO / Rd JO JO 



G / utU«(* s a) e* dW s 



E 



E 



G / 2* eft 



This shows, by definition, that Vtl A e {t, ■) efte^ is Skorohod integrable and 



1^1^ (t, s) dt e* dW s = zt dt 



'0 Jo 

Recalling f|3.6[) we obtain 



T r T 



^0 



vtl^(i, s) e* dW s dt. 



o Jo 



v t l A * (t, s) dt e* dW s = / v t {W l t+e - Wl) l [m (t) dt 



T ,-T 



JO 



D\vt &i 1a c (t, s) ds dt, 



and ()3.5p is proved. 

Recalling that D s (u(t, Xt)) = for s > t by adaptedness, and using the chain rule (12. lip for 
the Malliavin derivative we have, for a.a. s, t with s G [t, t + e], 

D s (u(t + e,X t+e )-u(t,X t )) = D s (u(t + e,X t+t )) = [ D s y(t + e + 0) -V x u(t + e,X t+e )(d0) 

J\-r,0] 



and from (13.51) we deduce 



C e 



T" /•<+ 





T'+e psAT' 



( s -e)H 



-r,0] 



L>*y(t + e + 0) ■ Va;u(£ + e, X t+e )(e£0) eis eft 



(u(t + e, - u(t, X t )) dt e* dW s 



Now we let e — > 0, and we first claim that J| ~ ^ in probability. To prove this, it is enough 
to show that the process ^ J^(u(t + e, X t + e ) — u(t, X t )) lA e (t, •) dt converges to in L 1,2 . Indeed, 
since the Skorohod integral is a bounded linear operator from L 1,2 to L 2 (f2), this implies that 



o e Jo 



(u(t + e, X t+e ) - u(i, X t ))l A . (t, s) dt e* dW s 



in L (f2). We prove, more generally, that for an arbitrary element y G L '"(M), if we set 
1 /- T 1 /- sAT ' 

= - / (yt+e - yt) IahM) dt = - (y t+€ - yt) dt, s g [o,t], 

e JO e J(s~e)Vt 
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then the process T e (y) converges to in L 1,2 (M). Let us recall that L 1,2 (M) is isomorphic to 
^([O^JiD 1 ' 2 ^)). It is clear that T e {y) -» in L 1 > 2 (R) if y belongs to C([0, T]; D 1 > 2 (M)), a 
dense subspace of L 2 ([0, T]; B 1>2 (R)). So to prove the claim it is enough to show that the norm 
of T e , as an operator on L 1,2 (R), is bounded uniformly with respect to e. We have 

\T e (y) s \li,2 {m) < ^ J l A e(t,s)dtJ^ \y t+€ -yt\li,2 iR) lA^t,s) dt 



< 



i A 



r(y)lL 2( 



l2ft+e - 2/tloi.2(R) dt, 

T 



\T e (y)sg^ ( 



ds 



o 



< 

< 
< 



-/ \yt+e -ytlliam [ iA<{t,s)ds 
6 Jo Jo 



dt 



r' 



\yt+e-yt\lx,2 (M ) dt 



This shows the required bound, and completes the proof that I\ — > as e — > 0. 

Now we proceed to compute the limit of If. We note that, by adaptedness, D s y(t + e- 
for s > i + e + 9, so that 

rT" /-t4 



1 



Jf 



s-t-e,0] 



D\y{t + e + (9) • V a u(t + e, X t+e ){d6) ds dt. 



For fixed t, let us exchange integrals with respect to ds and V x u(t + e,Xi +e )((i#) obtaining 

1 f 7 " f pt+e+0 
if = - / / / D\y(t + e + 9)ds- V x u(t + e, X t+£ )(d0) ttt. 
e Jo J[-e,0] Jt 

Next we replace D s y(t + e + 9) by the expression given by (13. 2ft and we obtain 

t+e+6 



e Jo 



-e,Q] Jt 



1 

+ - 

6 Jo J[-e,0] Jt 
e Jo J[-e,0] Jt 



a\s,X s ) ds ■ V x u(t + e,X t+e )(d9) dt 
t+e+e ft+t+e 

Dl[b(r, X r )] dr ds ■ V x u(t + e, X t+t )(d9) dt 



D\[a{r,X r )] dW r ds ■ V x u(t + e,X t+e )(d9) dt 



t+e+6 rt+e+ 



— : J\ + ,J\ + J 3 . 
We first show that JJ ^ in L^fi). Since, by ([3T 



|V^(i + e,X t+e )| <C(1+ sup |X t | c r, 

te[o,T] 

then, using the notation |V x n(t + e, X t+e )\(d9) to indicate the total variation measure, we have 



l J 3 £ l ^ 7 , . 

e Jo J[-e,0] Jt 



t+e+6 



t+e+6 



Di[a(r,X r )] dW r 



ds \V x u(t + e,X t+e )\(d9) dt, 



1 fT' /-t+e+6 

< C(l + sup |X t | c ) m - / sup / 
te[o,T] e Jo ee[-£,o] Jt 



t+e+6 



<C(1+ sup |X t | c ) 

te[o,T] 



T' rt+e 



sup 

e Jo Jt 6e[-e,0] 



t+eH 



D*[<7(r,JT r )] dW r ds dt, 



D l s [a(r,X r )} dW r ds dt. 
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Taking the L x (0) norm of both sides and using the Holder and the Doob maximal inequality 
we have 



J: 



s\\v-{n) 



1 



<C||(1+ sup \X t \ c ) m \\ L * m - 
te[o,T] e 



pT' nt+e 




rt+e+6 




sup 


/ Di[a(r,X r )]dW r 


Jo Jt 


0€{-e,O] 


J s 



ds dt 



< 



C 



e 

C 
e 



T pt+e 



t+e 



L 2 (U) 



ds dt 



1/2 



D l s [a{r,X r )\ dW r 

Jt Js 

fT > f -t+e / ft+e \ -V" 

J J (j E\Dl[a(r,X r )}\ 2 dr) ds 

i 1 2 

Denoting for simplicity h(s,r) = E \D\[a{r, X r )]\ we obtain 



dt. 



\J' 



311^(0) 



C 

< — 



T> rt+e 
Jt 



t+e \ 1/2 

h(s, r) dr I ds dt 



c f T ' ( r t+e f t+t \ 

' h(s, r) dr ds ) dt 



Ve Jo 



T 



t Jt 
1 

e 



t+e rt+e 



h(s, r) dr ds 



1/2 



dt 



Let us note that h G L 1 ([0, T] 2 ), since by (|3.3p we have 



I f h(s,r) dr ds <E / ||D. [<r(r, X r )\ fdr < L 2 T [ E sup \D s y(t)\ 2 ds 
Jo Jo Jo Jo te[s,T] 



< oo. 



Let us define the operator A e : L l ([Q,T] 2 ) L 1 ([0, T]) by 



(A t k){t) = ~ 



At+e)AT At+e)AT 

I I k{s,r)drds, fca^O.T] 2 ). 

Jt Jt 



Then we have H^Hl 1 ^) < CV^'ll-^-e^H^wjo t])' so ^° P rove that J| — > in L 1 (J7) it is enough 
to show that A e k — > in L 1 ([0,T]) for every k G L 1 ([0, T] 2 ). This is obvious if A: is in the space 
of bounded functions on [0, T] 2 , a dense subspace of L 1 ([0,T] 2 ). So it is enough to show that 
ll-A^lli^Qo.T]) ^ C 1 1 ^ 1 1 z, 1 ([o,T] 2 ) f° r some constant C and for every k G L 1 ([0,T] 2 ). This follows 
from the inequalities 



\A e k(t)\ dt < 



T ,-T 



T r T i-T 



JO 



JO 



\k(s,r)\ 



l^( s ) r )l 1 t<s<(t+e)AT 1 t<r<(t+e)AT^ d $ dt 

r-T f-T 



^(s-e)+ <t<s^(r~e)+ <t<sdt 



ds dr < 



o Jo 



\k(s, r)\ ds dr, 



since the term in square brackets is less or equal to e. 

This finishes the proof that J| — > in L 1 (0), hence in probability. In a similar and simpler 
way one proves that J| — > in probability. 

To finish the proof of the proposition it remains to compute the limit of J{. Exchanging 
integrals with respect to ds and V x u(t + e, Xt +t ){d9), and then using another change of variable 
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we have 



i fT' r rt+e+9 

J{ = - / / v l (s, X s ) ds ■ V x u{t + e, X t+e ){d6) dt 

e Jo J[-e,0] Jt 

= - f [ [ a i {s,X s )-V x u{t + e,X t+e ){de) dsdt 

e Jo Jt J[s-t-e,o] 

= -/ / a l (s,X s )-V x u(t + e,X t+e )([s-t-e,0])dsdt 

e Jo Jt 

1 f T+e ft ■ 
= -/ / a l (s,X s )- V x u(t,X t )([s-t,0}) dsdt 

e Je Jt-e 
1 fT'+e ft . 

= -/ / a\t,X t )-V x u(t,X t )([s-t,0})dsdt 

e J € Jt-e 

+- I I W(s,X s )-a l (t,X t )}-V x u(t,X t )([s-t,0})dsdt 

e Jt Jt-e 



=: H{ + h£ € 

Next we show that if| ~* 0> P-a.s. Since 

\V x u(t + e,X t+e )\<C(l+ sup \X t \ c ) m , 

te[o,T] 

we have 

< C(l+ sup |X t | c ) m - T K(s,X s )-^,X t )|(i S( ft 

t6[0,T] 6 Jt-e 

< C(l+ sup |X t | c ) m / -f \a\s,X s )-a i {t,X t )\dsdt. 

te[0,T] Jo e J{t-e)+ 

Let us fix w G $7 and note that, P-a.s., a l (-,X.) G L 1 ([0, T]). Let us define the operator 
^:L 1 ([0,T])^L 1 ([0,T]) as 

(B e fc)(t) = - / \k(s) - k(t)\ ds, k G L\[0, T]). 

e J(t-e)+ 

Then we have \H%\ < C(l + sup te[0iT ] |X t |c) m ||H e cj'(-, X)|| l i([ 0j t]), P-a.s., so to prove that H\ -> 
in probability it is enough to show that -> in L 1 ([0,T\) for every G L 1 ([0,T]). This 
is obvious if A; is in the space of continuous functions on [0, T], a dense subspace of L 1 ([0, T]). 
So it is enough to show that ||# e A;||£,i([o,T]) < C||^IIl 1 ([o,t]) f° r some constant C and for every 
A; G L 1 ([0,T]). This follows from the inequality 



\(B £ k)(t)\<- f \k(s)\ds + \k(t)\, 

e J(t-e)+ 



which implies 



/ |(B e A;)(t)| dt < \\k\\ Ll{[0>T]) + - f I \k{s)\ ds dt 

Jo e Jo J(t- e )+ 

I fT f(s+e)AT 

= \\k\\m[o,T]) + - / / \k{s)\ dtds 

e Jp Js 

< \\ k hi({o,T]) + / |*(«)| ds = 2||fc|| L i ([0iT]) . 
Jo 
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This finishes the proof that H\ — > P-a.s., hence in probability. 
It remains to consider the term 



i r T+t ■ r* 

H[ = - a % (t,X t )- V x u{t,X t )([s-t,0})dsdt 

e Je Jt-e 



1 



t-e 

T'+e r i 



a\t,X t )- / V x u(t,X t )(d9) ds dt 

Jt-e J[s-t,0] 
T'+e r rt+e 



i rl +e r rt+0 

- a'i^Xt)- / dsV x u(t,X t )(de)dt 

e Je J[~e,0] Jt-e 

-I a%X t )- I (6 + e)V x u(t,X t )(d6)dt 

e Je J[-e,0] 

[ * a\t, X t ) ■ [ + V x u{t, Xt){dO) dt. 

Je J\-rM \ e J 



We clearly have, P-a.s. 



/ (l + -) + V x u(t,X t )(d6) -» / l {oy (6)V xU (t,X t )(d6) 

J[-r,0] \ e J J[-r,0] 

= V x u(t,X t )({0}) = V u(t,X t ), 

and by dominated convergence, P-a.s., 

H{ —* / a\t,X t )-V u(t,X t )dt. 
Jo 

This shows that C e converges in probability and its limit is 

r T ' ■ 

(u(;X.),W l ) [0>Tf] = / a l (t,X t ) ■ V u(t,X t ) dt. □ 
Jo 

4 The forward-backward system with delay 

In this section we will discuss existence, uniqueness and regular dependence on the initial data 
of the following forward-backward system: for given t G [0, T] and x G C = C([— r, 0]; R n ), 

dy T = 6( T) X t ) dr + <r(r, X T ) dW r , r G [i, T] C [0, T), 
X t = x, 

dY T = ^(t, X T ,Y T , Z T ) dr + Z T dW T , { ' ' 

Y T = <j>(X T ), 

Here we use the notation X T {9) = y T +e-, 6 G [— r, 0], as before, so the first equation in (|4.ip is 
the same as (|2.4p . We extend the definition of X setting X s = x for < s < t. The second 
equation in (|4.ip . namely 

dY T = ^(r, X T , y T , Z T ) dr + Z T d^ r , r G [0, T], 

y T = ^(x r ), 

is of backward type. Under suitable assumptions on the coefficients ift : [OjTjxCxlxl^l 
and </> : C — > R we will look for a solution consisting of a pair of predictable processes, taking 
values in R x R rf , such that Y has continuous paths and 

rT 

||(Y,Z)||i cont :=E sup |Y T | 2 +E/ \Z t \ 2 dr < oo, 

re[0,T] JO 
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see e.g. [23J. In the following we denote by K cont ([0,T]) the space of such processes. 

The solution of (|4.ip will be denoted by (X T , Y T , -^ T )re[o,T]! or > to stress the dependence on 
the initial time t and on the initial datum x, by (Xr' x , Yr' x , Z*' 1 ) rg [ >m . 

Hypothesis 4.1 The maps ^[O^jxCxKxl^R and <f) : C — > R are Borel measurable 
and satisfy the following assumptions: 

1. there exists L > such that 

\ip(t,x,y,zi) -^(t,x,y,z 2 )\ < L\z\ - z 2 \ , 
\ip(t,x,y!,z) - ip (t,x,y 2 ,z)\ < L\yi - y 2 \ , 



for every t G [0, T], x G C, y, y%, y 2 G R and z, z\,z 2 G R ; 

2. ip(t, ■,-,■) eG 1 (CxRx M. d , R) /or every t G [0,T]; 

3. i/tere exisi -fC > and m > such that 

|V X V (t, x,y, z)\ <K(1 + \x\ c + \y\) m (1 + \z\) 
for every t G [0, T\, x G C, y G R and z G R d ; 
^. G C/ 1 (C,R) and there exist K > and m > suc/t that 

\V(f>(x)\ < K (l + \x\ c ) m , xeC. 

Under these assumptions we can state a result on existence and uniqueness of a solution of 
the forward- backward system (14. lj) and on its regular dependence on x. 

Proposition 4.2 Assume that Hypotheses \2.2\ and \4-l\ hold true. Then the forward-backward 
system 1^4 -1\ > admits a unique solution (X*> x ', Y t,x , Z l < x ) G S p ([0, T];C) x K con t ([0, T]) /or ev- 
ert/ (t,x) G [0,T] x C. Moreover, the map (t,x) i— > {X t)X , Y t,x , Z t,x ) belongs to the space 
Q 1 ([0, T] x C,5 P ([0, T}; C) x IC con t ([0, T])). Finally, the following estimate holds true: for every 
p > 2 there exists C > suc/i that 

E sup IVa-l^ 

i-e[o,T] 

Proof. We only give a sketch of the proof. The forward equation has a unique solution by 
Theorem 12.31 Existence and uniqueness of the solution of the backward equation follows from 
the classical result [23j . 

In Theorem 12.31 we have shown that the map x t— > X t,x belongs to C 1 (C,S P ([0, T] ; C)) for 
every 2 < p < oo. Then the proof of continuity and differentiability of (t, x) ^ (X ' x , Y l ' x , Z t,x ) 
in the appropriate norms, as well as the final estimate on V x Yr' x , can be achieved as in Propo- 
sition 5.2 in [TO]. The only difference is that in pjj] the process X t,x takes values in a Hilbert 
space, while in our context it takes values in the Banach space C; nevertheless the same argu- 
ments apply (see also |16j for a similar result in Banach spaces). □ 



x/p 



<C[l + \x 



(m+l)' 



t G [0, Tl, x G C. 
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Corollary 4.3 Assume that Hypotheses \2.2\ and \4-l\ hold true. Then the function v : [0, T] x 
C — > R defined by 

v(t,x) = Y t f ' x , t e [0,T],x e C, (4.3) 
belongs to Q 0,1 ([0, T] x C;M). Moreover there exists C > such that 

\V x v(t,x)\ < c(l + \x\ ( ™ +1)2 ) , t e [0,T],xe C. 

Finally, for every t £ [0, T] and i £ C, w /iaue, P-a.s, 

1?* = u (s, jtf*) , for every s € [i, T] , (4.4) 

Z*' x ' = V v (s, Xl' x ) a(s, Xl' x ), for a.e. s G [t, T]. (4.5) 

Proof. It is well known that v(t,x) is deterministic, and its properties are therefore a direct 
consequence of Proposition 14.21 Equality (|4.4p is also a standard consequence of uniqueness of 
the solution of the backward equation. 

To prove (|4.5p we consider the joint quadratic variation of Y t,x and the Wiener process W l 
on an interval [t, T'], with T' < T. Taking into account the backward equation we obtain 

(¥'•*, W% tTI] = ^ Zlds. 

By Theorem 13. II we have 

(v {;X^),W% >TI] = ^ a\s,XY) ■ Vov(s,Xl>*) ds, 
so that (|4T4]) implies (|43|) . □ 

Remark 4.4 If we strengthen slightly the regularity assumptions and we require that, for all 
t £ [0, T], the functions b(t, -),o-(t, •), 4> are continuously Frechet differentiable on C and tp(t, •, •, •) 
is continuously Frechet differentiable oeCxIx M. d , then we can prove, with only minor changes 
in the proofs, that the function v defined in (|4.3p is Frechet differentiable with respect to x and 
the Frechet derivative is a continuous function from [0, T] x C to the dual space C* with respect 
to the usual norm (i.e. the variation norm). 

Remark 4.5 In the context of Proposition 14.21 the law of the solution (X t)X , Y t,x , Z t,x ) is 
uniquely determined by ,x and the coefficients b,a,ip,(j). Since v(t, x) is deterministic, hence 
determined by its law, we conclude that the function v is a functional of the coefficients b, a, ip, <j) 
and does not depend on the particular choice of the probability space (fi, J 7 , P) nor on the Wiener 
process W. 



5 Application to stochastic optimal control 

5.1 Strong formulation of the optimal control problem 

Let (CI, J 7 , (^Ft)t>o >P) be a filtered probability space, satisfying the usual conditions, and let W 
be an W 1 - valued standard Wiener process with respect to (J-t) and P. We consider the following 
controlled functional stochastic equation on an interval [t,T] C [0,T]: 

/ dy^ = b(s,y% + .) ds + a(s,y^ + .) [h(s,y^ + .,u s ) ds + dW s ], . , 

\ y? +e = x(e), 9 G [— r, 0], { °- 1 
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The coefficients b and a satisfy the previous assumptions, u(-) denotes the control and y u the 
corresponding solution. We assume that controls are (J-i)-predictable process with values in a 
given measurable space (U,U). The function h : [0, T] x C x U — > M d is measurable and bounded. 
We introduce again the process 

X s " = y u s+ . = {y« +e , G [-r,0]}, « G [i,T], (5.2) 

which now depends on the control and takes values in C = C([— r, 0]; M n ), so that equation (|5.ip 
can be rewritten as 

d^ = 6( S) X«)d S + a( Sj X«)[/ l ( S ,X?,u,))cfa+ dWj, sG[i,T], 
Xt = x. 

We introduce the cost functional to minimize: 

J{t,x,u(-))=E I g{u s )ds + E(f>(y% + .)=E I g{u s ) ds + E<f>(X%), (5.4) 
Jt Jt 

where g : U — ► [0, oo) and : C — ► R are given functions. 

Remark 5.1 Without any substantial change, we could consider more general cost functionals 
of the form 

J(t, x, u(-)) = E / [«) + ds + E<j>(y% + .), (5.5) 



where £ : R n — > R. In fact, this kind of cost can be put in the form (|5.4p as follows: first note 
that in equation (15. ip we can assume r > T, possibly extending the functions b and cr in the 
obvious way; next we define, for x G C, 

so that (fro (Xj,) = £o(Us) ds and we conclude that 

J(t, x, «(■)) = E ^ </(u«) + E[(<£o + 

which has the required form. In a similar way, under suitable assumptions, one could consider 
even more general costs of the form 

J(t, x, u(0) = E £(s, y u s , u s ) ds + E 
However, we limit ourselves to cost functionals with the structure of (15.41). 



To proceed further we need to introduce the hamiltonian function ip : [0, T] x C x R d — ► R 
defined, for t G [0, T], x G C, z G R d , by the formula 

(t, x, z) = inf {g(u) + zh (t, x,u) : u £ U} (5-6) 

and the corresponding, possibly empty, set of minimizers 

T (t, x, z) = {u G U, g{u) + zh (t, x,u) = ifr (t, x, z)} . (5-7) 
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Remark 5.2 By the Filippov Theorem, see e.g. [2], Theorem 8.2.10, p. 316, if U is a complete 
metric space equipped with its Borel cr-algebra, g is measurable, h is measurable bounded, with 
u i— > hit, x, u) continuous on U, and if T takes non-empty values, then T admits a measurable 
selection, i.e. there exists a Borel measurable map Fq : [0, T] x C x R d — > [/ such that To (i , x, z) G 
r(i,x,z) for t G [0,T], xe E, z£R d . 

We are now ready to formulate the assumptions we need . 

Hypothesis 5.3 1. [U,U) is a measurable space, g : U — > [0, oo) is measurable, h : [0,T] x 
C x (/-> R rf is measurable and bounded; 

2. the hamiltonian if) defined in \5. 6\) satisfies the requirements of points 2 and 3 of Hypothesis 

B 

3. the function 4> ■ C — > R satisfies the requirements of point 4 in Hypothesis \4-l[ namely it 
belongs to Q l (C,M) and there exist K > and m > suc/i i/iarf 

|V0(x)| <K(l + |a;| c ) m , ieC. 

Remark 5.4 1. Hypothesis 15.31 is stronger than Hypothesis 14. 1[ Indeed, point 1 of Hypoth- 
esis H?T] is a straightforward consequence of the fact that h is assumed to be bounded. 

2. In the case U C M. k , h(t, x, u) = u, the previous assumptions require in particular that the 
set U where control processes take values should be bounded. 

3. The assumptions on the hamiltonian function tp can be easily verified in specific cases. 
For instance if U is a closed ball of M. k centered at the origin, and g(u) = go(\u\ p ) for 
some p > 1 and some convex function go : [0, oo) — > [0, oo) such that g G C 1 ([0,oo)) and 
g' (0) > 0, then the hamiltonian is differentiable with respect to z and ip satisfies points 2 
and 3 of Hypothesis 14.11 

Now let us consider a probability space (Q,^ 7 , P), a standard Wiener process W in R d , and 
the following forward-backward system: 

' dy T = 6(t, X t ) dr + <r(r, X T ) dW Tj r G [i, T] C [0, T], 

< ^ = X ' , ^ ~ (5-8) 

dY T = ${X T , Z T ) dr + Z T dW T , y ' 

, Y T = <t>{X T ). 

By Remark 14.51 the function v : [0, T] x C — > R defined by the equality 

v (t, x) = y/' x (5.9) 

is a functional of the coefficients b,a,tl>,4> and does not depend on the particular choice of 
(f2,JT, P) nor on the Wiener process W. 

In the following proposition we show that the function v, defined in this way by means of 
an appropriate forward-backward stochastic differential system, plays a basic role in the control 
problem. 

Proposition 5.5 Assume that Hypotheses \2.2\ and \5.3\ hold true, and that the cost functional 
is given in |5.^[ ). Let v be defined in t5.9\) . Then for every t G [0, T] and x G C and for every 
admissible control u(-) we have J (t, x,u(-)) > v(t,x). 
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Proof. We fix t,x and a control u(-). Let (-^") T e[*,T] ^ e the corresponding process denned by 
(|5.2p . We define the process 



W? = W T + h (s, X^u s ) ds, r G [0, T] 

JthT 



and we note that X u solves the equation 



dy^ = b{r, X™) ds + <t(t, X%) dW?, r G [t, T) , 
X?(0) = x(9), 0€[-r,0]. 



(5.10) 



Since h is bounded, we can apply the Girsanov theorem and deduce that there exists a probability 
measure P" on such that VF" is a Wiener process with respect to P". We remark that, 

by uniqueness, X u is in fact a continuous process adapted to the natural filtration generated by 
W u and augmented by the P n -null sets. In ($7, J 7 , F u ) let us consider the backward equation for 
the unknown process (Y^,Z^), r G [t,T\: 

Y T U + J T ZyW? = <t>(X%)+ J 1 \p(s,X^,Z^ds, re[t,T\. (5.11) 

We notice that the forward-backward system (|5.10p - (|5.11|) has the form (|5.8|) and we conclude 
that Y t u = v (t,x), where v is defined in (|5.9[) : in particular, it does not depend on u(-). 

Now we wish to prove that ZgdW s has finite expectation, equal to zero. By the Burkhol- 
der-Davis-Gundy inequalities, it is enough to prove that 



E 



\Z u A l ds 



1/2 



< CO. 



(5.12) 



We remember that 

d¥ u 



cH 



exp (- £ h (s, Xf, u s ) dW s ~\[ \h 0, X?, 



it.,) I ds 



d¥ u 



We denote by p, and by E" the expectation with respect to P u . We estimate 



E 



T \ 1/2 

\Z 1 t\ 2 ds\ =E U 



\ 1/2 
\Zs\ 2 ds) p- 

1/2 



< ( E u / iZjfds 
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Since the process Z u , solution to (|5.1ip . is square-summable withe respect to P", it remains to 
prove that E" [p~ 2 ] is finite. Noting that 

p- 1 = exp h (s, X?,u a ) dW? -l£\h(s, X u s , u s )\ 2 ds^j , 
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and recalling that h is bounded we get, for some constant C, 



E u [p 
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E n 



exp 2 



h(s,X?,u 8 )dW? 



1 



\h(s,Xf,u s )\ 2 ds 



,u,)\ ds 



expQf 2h(s,X%,u s )dW?-~£ A\h(s,X^ 
ex pQ" 2\h(s,X^,u s )\ 2 ds^ 

<CE U expQf 2h(8,X?,u a )dW?-^J 4\h(s,X^u s )\ 2 ds 
= C. 

Now (I5.12p is proved and therefore Z^dW s has zero expectation with respect to the original 
probability P. If we set r = t in (|5.1ip and we take expectation with respect to P, we obtain 

v (t, x) = E0 (X%) + E J [ij> (s, X%, Z u s ) - Z u s h (s, X?,u a )] ds. 

Adding and subtracting E f^g (u s ) ds we arrive at 

v(t,x) = J(t,x,u(-)) [iP{s,Xf,Z^) - Z^h(s,X^,u s ) - g(u s )]ds. (5.13) 

By the definition of ip the term in the square brackets is non positive and consequently v (t, x) < 
J (t,x,u(-)). □ 
The equality f)5. 13|) can be regarded as a version of the so-called fundamental relation. We 
immediately deduce the following consequences: 

Proposition 5.6 Let t £ [0, T] and x £ C be fixed. If a control u(-) satisfies J (t,x,u(-)) = 
v (t,x) then u(-) is optimal for the control problem starting from x at time t. 

Assume that the set-valued map T has non empty values and it admits a measurable selection 
Tq : [0, T] x C x M. d — > U , and assume that a control u( ) satisfies 

u T = T (r, X", Z") , F-a.s. for almost every r G [t, T) . (5.14) 

Then J (t, x, «(•)) = v (t, x), u(-) is optimal, and the optimal pair (u(-),X u ) satisfies the feedback 
law 



T (t,X?,Vov(t,X?)o-(t,X?)) 



-a.s. for almost every r € [t,T] . 



(5.15) 



We note that ([535]) follows from ([3TT^|) and 

However, we can not prove the existence of an optimal control satisfying (|5.14p (and hence 
(|5.15p ). Such a control can be shown to exist if there exists a solution to the so-called closed- loop 
equation 

dy T = b (r, X T ) dr + a(r, X T )[h(r, X T ,T (r, X T , V v (r, X T ) a (r, X T )))dr + dW T ], r G [t, T] , 
X t {6) = x(9), 0G[-r,O], 

(5.16) 

since in this case one can define an optimal control setting 

u T = T (r, X T , V v(t, X T )a(r, X T )) . 

However, under the present assumptions, we can not guarantee that the closed-loop equation 
has a solution in the usual strong sense. To circumvent this difficulty we will revert to a weak 
formulation of the optimal control problem. 
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5.2 Weak formulation of the optimal control problem 



We formulate the optimal control problem in the weak sense following the approach of [9], see 
e.g. chapter III. The main advantage is that we will be able to solve the closed loop equation in 
a weak sense, and hence to find an optimal control, even if the feedback law is non smooth. 

Initially, we are given the set U and the functions b, a, h, g, <f>. By an admissible control 
system we mean 

where (0, J 7 , (J~t) t>0 , P) is a filtered probability space satisfying the usual conditions, W is an 
Revalued standard Wiener process with respect to (J-f) and P, u is an (JP^-predictable process 
with values in U, X u satisfies (|5.2p -( j5T3j) . An admissible control system will be briefly denoted 
by (W,u,X u ) in the following. Our aim is now to minimize the cost functional 

J(t,x,(W,u,X u )) = E J g(u s ) ds + E<f)(X%) (5.17) 

over all the admissible control systems (W,u,X u ). We can prove the following results: 

Theorem 5.7 Assume that Hypoteses \2.S\ and 15.31 hold true, and that the cost functional is 
given in (5.17). Let v be defined in \5.9}) . Then for every t G [0,T] and x G C and for all 
admissible control system (W, u, X u ) we have 

J(t,x,(W,u,X u )) >v{t,x) 

and the equality holds if and only if 

u T G r(T,X?,V v{T,X?)a(T,X?)), F-a.s. for a.a. r G [t,T] . 

Moreover assume that the set-valued map T has non empty values and it admits a measurable 
selection To : [0, T] x C x M. d — > U. Then an admissible control system (W,u,X u ) satisfying the 
feedback law 

u T = r {X?,V v{T,X?)a(T,X?)), F-a.s. for a.a. r G [t,T] 

is optimal. 

Finally, the closed loop equation A5.16\) admits a weak solution ($7, J 7 , (J-t) t>0 , P, W, X) which 
is unique in law and setting 

u T = T (t,X t ,V v(t,X t )(j(t,X t )) , 
we obtain an optimal admissible control system (W, u, X) . 

Proof. The proof follows from the fundamental relation (|5. 13[) and the same arguments leading 
to Proposition 15.61 and the remarks following it. The only difference here is the solvability of 
the closed loop equation in a weak sense, which is however a standard application of a Girsanov 
change of measure. □ 



6 Parabolic equations and application to a pricing problem 

Let us consider again the Markov process {Xr' x , < t < r <T, x G C}, defined by the formula 
(|2.5p . starting from the family of solutions to equation (|2.4|) . Let us denote by (£-t)t£[o,T] the 
corresponding generator. Thus, each Ct is a second order differential operator acting on a suitable 
domain consisting of real functions defined on C. In the autonomous case, a description of the 
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generator, denoted by C, was given in Section 2, remark f2.4i In this section we treat semilinear 
parabolic equations driven by (Ct), which are generalizations of the Kolmogorov equations. We 
will introduce a concept of solution, called mild solution, that does not require a description of 
the generators. In the sequel the notation £ t will be used only in a formal way. 
The parabolic equations that we study have the following form: 

( dv(tx) + x ) _ ^ X) y ^ x ^ y oU ( i; x ) cr ( i) x )) ; 

dt (6.1) 

k «(r,z) = 4>{x), te[0,T], xeC, 

with unknown function v : [0, T] x C — ► R and given coefficients ip : [0, T] x C x R x R d — > R 
and : C — > R. We recall the notation Vof(£, x) introduced in (|2. 1 1) . We note in particular that 
Vo^(t, x) is a vector in R n whose components are denoted Vg^(t, x) (k = 1, . . . , n). If we denote 
by 0fc(t, x) (A; = 1, . . . , n, i = 1, . . . , d) the components of the matrix a(t, x), then Vou(t, x) <r(i, x) 
denotes the vector in M. d whose components are Ylk=i ^o v {ti x ) cr fc(^' x )i (z = 1, • • • , d)- 

Recalling the definition of the transition semigroup Pt )T given in (|2.6p . and writing the 
variation of constants formula for a solution to (16. ip . we formally obtain 



v(t,x)=P t , T [4>](x)- J P tiT ^(-XT,-),V w(T,>(T,-))](x) dr iG[0,T], xGC. (6.2) 

We notice that this formula is meaningful if Vo^ is well defined and provided <f> and tp satisfy 
some growth and measurability conditions. This way we arrive at the following definition of 
mild solution of the semilinear Kolmogorov equation (|6.1|) . 

Definition 6.1 A function v : [0, T] x C —* R is a mild solution of the semilinear Kolmogorov 
equation \6.1\) if v G Q l ([0, T] x C;R), there exist C > 0, q > s«c/i i/mi 

\v(t,x)\ + \V x v(t,x)\ <C(l + \x\) q , te[0,T],xeC, (6.3) 

and i/ie equality W. j| ) holds. 



The space C/ 1 ([0, T] x C;R) was described in Remark (|2.ip . | V x v(t, x)| denotes the total 
variation norm of the R n -valued finite Borel measure V x v(t,x) on [— r, 0]. 



Theorem 6.1 Assume that Hypotheses \2.2\ and \4-l\ hold true. Then there exists a unique mild 
solution v of h6.1\) . The function v coincides with the one introduced in Corollary \4-3[ 

Proof. At first we prove existence. For fixed t G [0,T] and x G C, let (Xr' x , Yr' x , Z*' x ) Tg [o,T] 
denote the solution of the forward-backward system (|4.ip and let v(t,x) be defined by equality 
(|4.3p . The required regularity and growth conditions of the function v were proved in Corollary 
31 so it remains to prove that v satisfies equality (]6.2p . To this aim we evaluate 

P tjT h/>(- 5 v (r, •) , V v (r, •) a (r, •))] (x) = E (r, «(r, V„w(r, Y^)a(r, X**))] 



where the last equality follows from (14. 4p and (14. 5h . In particular we obtain 

^ Pt,r « (r, •) , V ^ (r, •) a(r, ■)} (x) dr = K j\ (r, X^.' x , Y^' x , Z?) dr. 



(6.4) 
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Since the pair (Y t,x , Z l ' x } is a solution to the backward equation (|4.2p we have 
+ ^ Zl' x dW T = (j> (xjf) + ^ i> (X^, Yp x , Z l T ' x ) dr. 



Taking expectation and applying formula (|6.4p we get the equality (|6.2[) . 

It remains to prove uniqueness. Let v be a mild solution to (|6.1|) . so that for every s G 
[t,T]c[0,T], 

v (s, x) = E0 (X* x ) + IE ip {XP x ,v{t, X°> x ),V v (r, X S T > X ) a(r, X^)) dr. 

Since Xp x is independent on J- s , the expectations occurring in the last formula can be replaced 
by conditional expectations given T s . Next we note that x can be replaced by Xs' x , since X s ' x 

is ^-measurable. Using the identity Xp s = X T ' X , which follows easily from uniqueness of the 
forward equation, we finally obtain 

v {s, Xf x ) = E^<t> (Xlf) + ^ V {X*> x ,v(t, X^),V v (t, X?) a (r, X?)) dr 
= E^ri- E^ J" V [X?, v(t, Xl' x ),V v (r, X^ x ) a (r, X?)) dr, 

where we have defined rj = (f> (x^j + J tp (X^ ,v(t, X t T ' x ),V v (r,X^ x ) a(T,X^ x )) dr. By 
the representation theorem of martingales, see e.g. [6], theorem 8.2, there exists a predictable 
process Z G L 2 (O x [0,T] such that E^t? = y Z r cW r + v (t,x), s G [t,T]. So 



=u(t,z)+ / Z T dW T - ^(X^^^X^^ov^Xt^a^X^dT. (6.5) 

Now we compute the joint quadratic variation with W l of the processes occurring at both 
sides of this equality, on an interval [t,T'] C [t,T). Considering the right-hand side we obtain 
Z\dr by the rules of stochastic calculus. By Theorem 13.11 we have (v(-,X*' x ), W % )ut'] = 
J* t T <r l (r, Xt' x )Vqv (t, Xr' x ) dr. Therefore we have Z T = <j(t, X*' x )Vow(t, Xr' x ) and equality 



6.5p can be rewritten as 

v(s,Xt> x ) = v(t,x) + J" 'v v (r,X^) a(T,X^)dW T 

i> (X^,v(r,X^),V v [t,X$*) <t(t,X?)) dr 



Xfi) ~ [ V v (r, Xp x ) a(r, X*> x )dW T 

rp 

+ / V {Xi> x , v(t, X^^ov (r, X?) a(r, X?)) dr. 

J s 



By comparing with the backward equation in (|4.ip we see that the pairs of processes (Yg ,x , Z t,x 



s 



and yu(s,Xg ),Vqv(s,X s ' )a(s,X s ' )J, s G [t,T], solve the same equation. By uniqueness of 
the solution we have Yg' x = v(s, Xl' x ), s G [t,T], and for s = t we get Y^' x = v (t,x). □ 
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Remark 6.2 The proof of uniqueness is based on an application of Theorem 13.11 Inspection 
of the proof shows that uniqueness holds in a larger class of functions. Namely, if a Borel 
measurable functions v : [0, T] x C — > R satisfies v(t, •) G ^ 1 (C,M) for every t G [0, T], and the 
inequality 

\v(t,x)\ + \V x v(t,x)\ < C(l + \x\) q , t G [0,T],x G J5, 

holds for some C > 0, q > 0, and the equation (|6.2p holds, then u coincides with the solution 
constructed in Theorem 13.11 

Remark 6.3 If in (|6.ip we take as ^ the hamiltonian defined in (|5.6p and as the final cost 
in the cost functional (|5.4p . equation (|6.ip is the Hamilton Jacobi Bellman equation related to 
the stochastic optimal control problem we have treated in section 5. It turns out that the value 
function coincides with the solution of (|6.ip . 



6.1 Application to pricing 

We consider a financial market, of Black and Scholes type, with one risky asset, whose price at 
time t is denoted by St, and one non risky asset, whose price is denoted by Bt- We assume the 
following prices evolution: 

dS t = ii(t, S t+ .) S t dt + a(t, S t+ .) S t dW t , t G [0, T], 
Se = so, G [-r,0], , , 

dB t = pB t dt, te[0,T], 
B = l, 

where p > 0, r > and s G C = C([— r, 0], R). We notice that the coefficients p and a depend 
on the past trajectory: St+- stands for the past trajectory of length r, i.e. St+. = (5t + 6))g g [_ ri0 ]- 
Moreover we consider a contingent claim of the form 

<f>( S T+-), 

where (j) : C — ► R. If r > T then the claim depends on the whole evolution in time of the prices 
of the shares: see [3], [21] or [29J and references within for a general discussion on such kind of 
options, usually referred to as path- dependent. 

We denote by 7Tj the value of the investor's portfolio invested in the risky asset at time t. ir is 
called a trading strategy; we will only consider predictable trading strategies which are square- 
integrable, i.e. E J Q T | -7r £ | 2 cZi < oo. We notice that the value Vt of the corresponding self-financing 
portfolio satisfies the equation 

dV t = pV t dt + n t a(t, S t+ .) 9{t, S t+ .) dt + 7r 4 a{t, S t+ .) dW t , (6.7) 

where 

a( , Q \ v(t,St+.) ~ P 

V{t,i> t+ .) - —7- r— 

a{t,b t +.) 

is called the risk premium. 

At time T the investor has to pay a contingent claim of the form (J)(St+-), where <p : C — > R 
is some given function. The pricing problem is to find and characterize pairs (tt, Vq) consisting 
of a strategy ir and an initial capital Vq G R such that 

V T = </>(S(T + ■)). 

tt is then called a hedging strategy and Vq is called the fair price of the claim at time t = 0. 
Throughout this section we assume the following. 
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Hypothesis 6.4 1. (Wt)t>0 ^s a rea ^ Wiener process defined in a complete probability space 
(f2, J 7 , F) and (J-t)t>o is the filtration generated by W augmented with null sets. 

2. p : [0, T] x C — > R is Borel measurable and bounded, and there exists L > such that 

Ht, f^fiO) - /z(t, / 2 )/ 2 (0)| < L\h - / 2 | c , (6.8) 

for allte [0,T], f 1 , f 2 G C; moreover, /x(t, •) G G l (C,R) for all t £ [0,T]. 

5. a : [0, T] x C —* R is Borel measurable and there exists c > suc/i 

k(t,/)|>c, (6.9) 

/or every f G C, so i/tai i/ie risfc premium in \6. 1\) is well defined and bounded; moreover 

W(t, Z 1 )/ 1 ^) - <r(t, / 2 )/ 2 (0)| < L|/x - / 2 | c , (6.10) 

/or a suitable L > and for all t G [0,T], /\/ 2 G C; finally, cr(i, ■) G ^(C^R) /or a// 
tG[0,T] 

^ G ^(CjR) satisfies \V<p(x)\ < C(l + |x| c ) m /or all x e C and some C > and m>0. 

By the Girsanov theorem there exists a probability measure, called risk- neutral probability, 
for which ^ 

W t = [ 9( T ,S T+ .)dT + W t , te[0,T], 
Jo 

is a Wiener process. Then 

dS t = P S t dt + a(t, S t+ . ) St dW t , dV t = P V t dt + ir t a{t, S t+ . ) dW t . 

The existence of a hedging strategy can be established as follows: using the results of Section [3] 
we first find a solution to the following forward-backward stochastic differential system 

' dS t = pS t dt + a (t, S t +.) S t dW t , t G [0, T] , 

< So +- =s > _ (6 11) 

dV t = pV t dt + Z t dW t , { ' 

k Vt = 4>(St+-)- 

Next, recalling (|6.9p . we note that the required hedging strategy can be recovered from the 
process Z setting ir t = Z t /a(t, St+.). 

However, a better characterization of the hedging strategy and the fair price of the claim can 
be obtained. We first consider, for arbitrary t G [0, T] and s G C, the following forward-backward 
system, which generalizes (|6. 1 1[) : 

' dS T > s = pS 1 / dt + a (r, 5*^.) S T ' S dW T , r G [t, T], 

qt,S _ 

dV^ S = P V^ S dT + Z l f dW T , 

> V T ' S = HSt+X 

with unknown triple (St s , Vt' s , Z T ' S ). Setting X T ,S = 5"*+., then X is a Markov process in C 
with generator C. We finally define 

v(t,s) = V t f ' s , t G [0, T], s G C. 
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It follows from Corollary 14.51 that Zr' s = Vov(t, Xr ,x ) a(r, A*' 2- ). We conclude that the fair price 
and the hedging strategy are uniquely determined as 



V o = v(0,s), 



Z 



0,s 



7T* 



V v(t,X?' s ) = V v(t,S t+ .). 



a(t,X?' s ) 



Moreover, see Theorem 16.11 v(t,s) is characterized as the unique mild solution of the equation 

' dv(t,x) 
— — h Lv{t,x) = rv(t,x), 

dt (6.12) 
k u(T,x) = <t>(x), t € [0,T], x € C, 
which can be considered as a generalization of the Black-Scholes equation to the present setting. 
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